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30 BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

This invention pertains generally to network communications, and more 
particularly to redirecting requests for content to a particular content server in a 
group of content servers based on a cost function indicative of operational 
35 characteristics of the network, such as network performance, 

2. Description of the Background Art 

One of the most common complaints of Internet users is that it simply takes 
too much time to access information. This is a particular concern for online 
vendors conducting e-commerce transactions. Many customers or potential 
40 customers may not return to a site if access is too slow, their browser locks up in 
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the middle of a transaction, transactions are duplicated because they resubmit 
information as a result of believing that their browser has locked up during a 
transaction, or they are otherwise dissatisfied with the overall speed of accessing 
the site. While some of the problems experienced can be caused by the site itself, 
many of the problems experienced today are due to network performance. 

One of the causes of unacceptable network performance is due to the fact 
that the Internet is a network of interconnected networks. As a result, more than 
one path may exist between the location at which the data originates and the 
destination to which the data will be delivered. Typically data is transmitted in 
"packets" which include a destination address. However, there is no guarantee 
that the route assigned to the data to reach that destination address will be the 
most efficient at all times due to the dynamic nature of network capability. For 
example, a path that is free at one point in time may become congested at another 
point in time. Or, a portion of the network may become unavailable, thus causing 
the packets to be re-routed after leaving the point of origin. Further, a network 
carrier who provides a portion of the intermediate path send the packets to 
another network carrier through a network access point if the destination is not a 
customer of that carrier, thereby resulting in data loss or additional re-routing. As 
can be seen, therefore, the packets must hop from network to network to reach 
their destination. The result is that each hop can contribute to additional delivery 
delay, or even cause a loss of data. The delivery delay that may be experienced 
can have a detrimental effect on the data transmission, and can be analogized to 
the decibel (db) loss experienced when signals are transmitted through a lossy 
medium. 

Rather than pursue routing solutions, a number of companies are pursuing 
"content delivery" techniques in an attempt to enhance performance on the 
Internet. These content delivery techniques include caching, intelligent routing, 
and content optimizing, as described below. 

For example, when an internet browser sends a request for a Web page, it 
first receives a hypertext markup language (HTML) file from the site that describes 
the Web page. The browser then sends one or more requests for the embedded 
files displayed on the Web page, such as images, audio clips, or video; that is the 
content. As can be seen, this process requires two or more requests for 


information to be made from the user to the Web page, thus having a direct impact 
on delivery time. To reduce the delivery time, some "content providers" cache the 
site's content on multiple servers at different geographical (physical) locations. 
Then, when a browser sends a request for a Web page, the HTML file that is sent 
5 back is accompanied by an internet protocol (IP) address that directs the browser 
to the geographically or topologically closest server that caches the content. The 
theory behind this approach is that the path to the server that is geographically or 
topologically closest to the user will be the fastest path. 

A refinement of the foregoing approach has been to optimize content and 
10 route the content based on demand. For example, the content of a Web page can 
be separated into "objects" representing images, video, text, and audio which are 
then ranked by user demand. Based on this ranking, the objects are cached at 
servers that are geographically closest to the users demanding access to the 
objects. To do so, a real-time map of Internet traffic is maintained for determining 
15 to which servers to send the objects for caching based on demand. 

While the foregoing techniques are designed to speed delivery of content to 
users, they all suffer from a common drawback; namely, that the path to the 
geographically closest location to the user may not be the optimum path. To the 
contrary, a path to an entirely different server that may not be the geographically 
20 or topologically closest may provide for a faster response time. Furthermore, such 
techniques do not direct a user to a particular content server based on a cost 
function in a closed loop system. Therefore, a need exists for a method and 
system for providing a user with the IP address of a content server based on 
criteria other than geography or topology alone. The present invention satisfies 
25 that need, as well as others, and overcomes the deficiencies in current 
techniques. 

BRIEF SUMMARY OF THE INVENTION 
The present invention generally comprises a method and system for 
pointing a user requesting access to content on a particular content server to one 
30 of a plurality of content servers that may provide that content. More particularly, 
the content server to which the user is directed is selected based on one or more 
criteria that satisfy a cost function indicative of operational characteristics of the 
network.. The invention operates in a closed loop where additional cost 
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measurements are made each time a user is directed to a content server and that 
information becomes part of the data store for evaluating which content server 
should be selected for serving subsequent requests for content. 

For example, the user may be directed to a particular content server if the 
5 user can access that serwer more quickly than another content server. While that 
content server may also be closest to the user in physical space, the two locations 
do not necessarily have to coincide. Another example would be to direct the user " 
to a content server based on one or more criteria that are indicative of best 
performance for that user. Another example would be to use reliability, such as 
10 least packet loss, to the content server as a performance criteria. Another 
example would be to direct the user to a particular content server based on 
economic or financial cost criteria. Still another example would be to use the 
O geographical location of the content server as a performance criteria. 

-J In other words, the invention directs a user to a content server based on a 

^ 15 cost function that takes into account one or more cost criteria indicative of 
^ operational characteristics of the network.. In this regard, note that the routing to 

2 a particular location is not being changed; routing is handled in a different manner 

1, through techniques described in, for example, U.S. Patent No. 6,009,081 which is 

W incorporated herein by reference. In the present invention, the user is being 

ry 20 directed to an IP address of a particular content server. 

y By way of example, and not of limitation, the IP address of the client who is 

requesting the content and the ID of the customer whose content is being 
requested are determined. The customer's ID is mapped to one or more content 
servers where the customer's content is located. Then the client's IP address is 

25 mapped to one of those content servers based on performance data to ensure 
that the best performance is achieved. It is important to note that, while the best 
performance to a content server for a particular user is one measure of cost, it is 
not the only measure of cost that can be used to direct the user. Nor is best 
performance necessarily synonymous with the geographically or topologically 

30 closest content server. Instead, the content server is selected based on one or 
more criteria that may include an estimate of the best performance, such as 
fastest response, and which may or may not translate to the geographically or 
topologically closest content server. Based on one or more cost factors such as 
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performance measurements, the client's IP address, the customer's ID. and the 
locations of the content servers, the present invention returns the IP address of 
the content server satisfying the cost function for that client. According to an 
optional aspect of the invention, all clients using the same DNS sen/er would be 
treated as satisfying the same criteria for ease of implementation. 

Further objects and advantages of the invention will be brought out in the 
following portions of the specification, wherein the detailed description is for the 
purpose of fully disclosing preferred embodiments of the invention without placing 
limitations thereon. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will be more fully understood by reference to the following 
drawings which are for illustrative purposes only: 

FIG. 1 is a system and data flow diagram showing configuration and 
operation of an embodiment of the present invention. 

FIG. 2 is a diagram depicting a geographical inference method according to 
the present invention. 

FIG. 3 is a diagram depicting a circular intersection locating technique 
according to the present invention. 

FIG. 4A through FIG. 4F are diagrams depicting an alternative embodiment 
of the circular intersection technique shown in FIG. 3 which uses linear 
approximations. 

FIG. 10 is a diagram depicting an intelligent inference technique according 
to the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

Referring first to FIG. 1 , a schematic diagram of a network configuration 10 
according to the invention is shown in the context of the Internet. FIG. 1 also 
depicts the preferred flow of information through the Internet according to the 
present invention. It will be appreciated, however, that the present invention is 
applicable to any interconnected network in which there are at least two servers 
within the network that contain the same content for a site that is being requested 
by a client. In the example that will be described, two content servers and three 
private network access points (PNAP) are shown. However, the invention can be 
applied to any number of content servers and PNAPs. It will further be 



appreciated that the invention is not linnited to use in connection with PNAPs; to 
the contrary, the invention is applicable to any network or networks to which at 
least two content servers are connected. The invention is described in the context 
of PNAPs as part of the preferred systenn in which the invention can be 
5 implennented. 

The premise of the present invention is that a client 12 desires to access 
content associated with a customer's Web site 14. By "client" we mean a 
computer that accesses a network such as the Internet through a connection, 
such as a PNAP. By "customer" we mean a network server, such as a Web site 
10 server, with which there is associated two or more content servers that may 
provide the desired content. In addition, by "content server" we mean a server 
with a unique internet protocol (IP) address, a group of servers with a common IP 
Q address, multiple data servers or the like, wherein the server(s) can be configured 

m such that selection can be made from among two or more server addresses. 

1^ 15 In the example described herein, which represents an embodiment of the 

^ system configuration in which the invention would be used, it is assumed that 

5 there is at least one PNAP, and preferably two PNAPs, connected to the Internet. 

1.^ It is also assumed that addressing is IP, that we are using HTTP and DNS, and 

W that the customer will have two or more content servers connected to the same or 

fj 20 different PNAPs. For example, one content server could be connected to a first 
S PNAP, a second content server could be connected to a second PNAP, a third 

content server could be connected to a third PNAP, and so on. For purposes of 
this example, also assume that the IP address of client 12 is 10.0.0.1, that the 
uniform resource locator (URL) of Web site 14 is www.superperformance.com , 
25 that the IP address of the customer is 197.172.20.14. that the customer's identity 
(ID) is "superperformance", and that the client desires to access the home page 
www.superperformance.com which is "index.html". Again, however, the invention 
is applicable to any network configuration with multiple content servers, as well as 
to any IP service or addressing protocol. 
30 In the example shown, client 12 will enter www.superperformance.com as 

the URL in their browser. At step 100, the client's browser sends a request for 
www.superperformance.com who, as stated before, is assumed to be a customer 
with content than can be accessed on two or more content servers located within 
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one or more PNAPs. The request is received by a domain name system (DNS) 
server 16 tliat is maintained either by the client or by the client's internet service 
provider (ISP). At step 102. DNS server 16 returns 197.172.20.14 as the IP 
address for superperformance.com. At step 104, the client's browser sends a full 
5 page request to 197.172.20.14 which connects with the customer's Web site 14. 

At step 106, Web site 14 returns the full html page, which describes the 
embedded content as well as a URL that includes a redirector host name. The 
redirector host name is a new protocol that is unique to the present invention, and 
which contains certain information that is used for determining which content 

10 server the client should access. In this example, the redirector host name would 
have the following basic format: client-ip.customer-id.lookup.pnap.net where client 
IP would correspond to 10.0.0.1, customer-id (e.g., customer identity) would 
correspond to "superperformance", and lookup.pnap.net would correspond to the 
address of a redirector DNS bridge (which is also a DNS server) which will be 

15 described below. Web site 14, which will know the client's IP address, will 
generate the redirector host name as part of the present invention. 

The more complete form of the URL for the content in this example would 
be http://1 0-0-0-1 .superperformance.lookup.pnap.net/index.html . It will be 
appreciated that other formats conveying the desired information could be used as 

20 well without departing from the present invention. For convenience, however, we 
have chosen to build a new layer on the existing DNS protocol. In other words, 
we have developed a new use of DNS. Instead of conventional IP to name 
mapping, we are creating an imaginary host name in a format that allows us to use 
DNS as the transport mechanism. The host name conveys additional information 

25 that is used for accessing the redirector DNS bridge and obtaining the IP address 
of the content server which provides best performance to the user. 

With this in mind, at step 108 the client's browser then sends the redirector 
host name to DNS server 16 which will not recognize the redirector host name. 
DNS server 16 will know, however, to make a recursive query to redirector DNS 

30 bridge 18 at step 110 because "lookup.pnap.net" will be contacted as a matter of 
the normal DNS lookup mechanism. 

At step 112 redirector DNS bridge 18 will send the client's IP address 
10.0.0.1 and the customer ID "superperformance" to redirector 20. At step 114, 
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using information in a dynamic database 22 or an alternative data store (including 
DNS bridge 18 itself), redirector 20 returns the IP address for the content server 
based on the client's IP address, the customer's ID. and cost information 
associated with accessing content servers containing the requested content. In 
5 the example shown, there are two content servers 24, 26 which replicate the 

requested content. At step 116, redirector DNS bridge 18 sends the IP address of 
content server 22 to DNS server 16. At step 118, DNS server 16 sends the IP 
address of content server 24 to client 12. At step 120, the client's browser then 
contacts content server 24 with the request to serve up the embedded content, 
10 and the requested content is sent to client 12 at step 122. 

Note also that FIG. 1 shows two "spanners" 28, 30 associated with content 
servers 24, 26 respectively. Spanners 28, 30, which in this example are part of 

O PNAPs 34, 36. respectively, monitor and collect the cost associated with the 

yd 

im client's request to the content server (e.g., latency, performance, etc.) The cost 

."^ 15 data is used to determine which content server will be selected by redirector 20 in 
response to subsequent requests for content. In the example thus described, this 

r S ; 

would take place after the client's browser contacts content server 24 and before 
content server 24 serves up the requested content. The performance 
m measurements can be stored in database 22 or alternative data store or even 

20 DNS bridge 18. As stated before, performance is one measure of cost which can 
Q be used to determine the content server to which the client will be directed. Other 

criteria can be used as well. 

As can be seen, therefore, there are essentially three components to the 
process. One is the determination of the IP address of the client who is 

25 requesting the content and the ID of the customer whose content is being 

requested. The second is mapping the customer's ID to one or more content 
servers where the customer's content can be accessed. The third is mapping the 
client's IP address to a particular content server based on one or more criteria 
indicative of operational characteristics of the network, such as network 

30 performance. It is important to note that network performance is not necessarily 
synonymous with the fastest path, and that the fastest path is not necessarily 
synonymous with the geographically or topographically closest content server. 
Instead, the content server is selected based on an evaluation of cost, which may 
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or may not translate to the fastest path, shortest response time, least packet loss, 
lowest economic cost of service, or the geographically or topographically closest 
content server for that client. The foregoing list is not all inclusive, but is provided 
by way of example of various measures of cost. Based on one or more measures 
of cost, the client's IP address, the customer's ID, and the IP addresses of the 
content servers, the redirector method of the present invention returns the IP 
address of the content server satisfying the cost function for that client. 

As an alternative to using the form of the redirector host name described 
above, a simplified form could be used. Referring to the previously example, the 
simplified form would be"superformance.lookup. pnap.net". Note that the client's 
IP address has been dropped. In this embodiment of the invention, redirector 20 
would base its decision on the IP address of DNS server 16 instead the client's IP 
address. In this way, all clients who are using the same DNS server would be 
directed to the same content server for "superperformance.com". 

It will be appreciated, therefore, that the cost measurements should be 
continuously updated. It will also be appreciated that these measurements may 
contain information concerning, for example, latency and reliability between a 
client and the content servers. In the exemplary embodiment of the invention 
described herein, latency measurements are obtained by employing spanners 28, 
30 for populating and updating a data store such as database 22. A "spanner" as 
used herein is a software process which measures and records, for example, 
costs such as latency between a client and a destination in the network. Note that 
the preferred location of a spanner is in proximity to a content server so that the 
spanner can monitor the performance of the client's access to the content server. 
If a spanner were located elsewhere, then it would not be able to determine the 
cost characteristics of the request and transfer of content. 

Referring to FIG. 1 , assume for example that client 12 is connected to a first 
PNAP 32, and that content servers 24, 26 are connected to second 34 and third 
36 PNAPs, respectively. In order to determine the cost associated with accessing 
a content server, data is collected between client 12 and the PNAPs 34, 36. For 
example, using the TCP/IP protocol, this is preferably done passively by watching 
the client's communications with destinations in a PNAP and measuring the time 
between the "syn ack" and the "ack" messages. The relative measurements 


between PNAPs can then be used to determine which content server can be 
accessed by the client with the least amount of network latency. It will be 
appreciated, however, that other handshakes may be employed if a protocol other 
than TCP/IP is used or developed. It will also be appreciated that there may be 
5 other suitable methods for making such measurements, including active methods 
such as pinging, that can be used as well, and that the invention is not limited to 
the foregoing passive method which is given by way of example as the preferred 
method. 

Note also that cost measurements between all clients and all network 
10 locations (e.g., PNAPs) containing content servers may not be available. For ease 
of implementation, one solution would be to obtain one or more cost 
measurements for a client using a particular DNS server and then treat all clients 
O using that same DNS server as if the costs were equivalent. Still another 

m approach would be to obtain one or more cost measurements and then correlate 

15 those measurements with other clients who may be similarly situated (e.g., 
^ geographically). Still another approach would be to use initial randomization 

y| 

^ associated with directing a client to a particular content server. For example, 

initially we may direct the client to PNAP 34 and content server 24. At that point in 
yi time, a cost measurement such as latency can be made. The next time that the 

20 client requests content we may force the client to go to PNAP 36 and content 
server 26 where a measurement has not yet been made. In this regard, it will be 
appreciated that complete information can be obtained by iterating the 
measurements over all possibilities. It will also be appreciated that other 
approaches to making cost measurements may be suitable as well. 
25 It should be apparent that randomization may be useful for normal system 

operation as well as initial operation of the system. As can be seen, we 
essentially need up-to-date cost measurements from each cluster or set of IP 
addresses (rather than each IP address) to each content server (i.e., to PNAP in 
which a content server is located). Therefore, even though we may have a 
30 measurement for PNAP 24, we may send the client to PNAP 36 instead to get a 
measurement. The amount of time a client is sent to a different PNAP would be 
arbitrarily set, such as ten percent of the time. Further, suppose that several 
prefixes want to access "superperformance.com" and that one thousand 
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customers in a prefix want to contact the site. The randomization could be applied 
to clusters or sets of IP addresses, such as prefixes. This allows us to strike a 
balance between exploration (e.g., making cost measurements) and exploitation 
of information (e.g., designating a content server). Otherwise, if the client is 
always directed to what was previously determined to be the least cost (e.g., best 
performing) content server for that client, we would never obtain measurements 
for other content servers for that client. This "overexploitation" of measurements 
could lead to less than desirable performance since a different content server may 
satisfy the cost function for that client. On the other hand, if we are always 
exploring (making measurements), we will never exploit the measurements. 

Accordingly, every time a client accesses a content server, the client 
provides performance data about that content server that can be observed by a 
spanner to measure and record performance, as well as provides data which can 
be used as an estimate of performance for the cluster or set of IP addresses 
containing the client's IP address. Therefore, it is possible to learn how long it 
takes for the client (and the group of addresses) to access the destination. When 
the invention directs the client to the content server that gives best performance, 
the system also obtains information about the client. It will be appreciated that, by 
sending the client to a different content server part of the time, the system adapts 
over time. 

While the preferred approach to obtaining and maintaining the cost 
measurement data has been described, it may not be practical to monitor the 
performance or other criteria from every client's IP address or every set of 
addresses (e.g., such as prefixes) to a PNAP due to the large number of 
measurements that are required. For example, there are approximately seventy 
thousand prefixes in use; therefore, the number of measurements that would be 
required would be seventy thousand times the number of PNAPs. If prefixes are 
close together geographically, however, an inference can be made that physically 
proximate prefixes will experience similar network performance. In other words, a 
predictive characteristic can be added to the operation of the redirector in order to 
reduce the number of measurements that are required. A particularly suitable 
technique is to employ circular intersection to identify the geographical location of 
clients and then group IP addresses based on correlation of locations. 


For example, referring to FIG. 2, suppose that there are multiple clients with 
sets of IP addresses (e.g.. prefixes) P1, P2 and P3 connected to a PNAP 200. 
Using circular intersection techniques according to the present invention, we can 
determine the physical location of a set of IP addresses. In the preferred 
embodiment of the invention, this can be accomplished by first obtaining latency 
measurements between a client and PNAP 200 whose latitude and longitude is 
known. The minimum latency measurement in the set generally represents the 
most accurate measurement for purposes of circular intersection according to the 
invention. For example, if one-thousand latency measurements are obtained, all 
of the measurements except for the minimum measurement would be 
disregarded. The minimum latency measurement can then be converted to a 
distance equivalent by multiplying the measurement by the speed of light and 
dividing by two in the case of a round trip measurement such as is obtaining by a 
ping. Note, however, that this computation may overshoot the actual distance to 
the client in most networks due to the use of optic fiber in the network and the fact 
that light actually travels at a lower speed in fiber. However, only an estimate 
needs to be made for purposes of circular intersection according to the present 
invention. 

In the case of FIG. 2, the minimum latency measurements from PNAP1 200 
to P1 and P2 are 53 ms and 66 ms, respectively, and the minimum latency 
measurements from PNAP 202 to P1 and P2 are 105 ms and 100 ms, 
respectively. Using this information, circles can then be drawn where the PNAP 
location is the center of the circle and the distance equivalent of the latency is the 
radius of the circle. In this regard, it will be appreciated that shorter return times 
result in smaller (i.e.. tighter) circles, thus providing more precise measurements. 
It will also be appreciated that any test that measures the time that it takes for a 
signal or data to move to a location and back can be employed without departing 
from the invention, and that one way of making such a measurement is by sending 
a TCP/IP "ping" to the client from the PNAP. 

Referring also to FIG. 3, by taking similar measurements from multiple 
PNAPs 200, 202. 204, several circles such as circles 300, 302, 304 can be drawn 
and the intersection of the circles will correspond to the approximate physical 
location of the sets of IP addresses (e.g., a prefix), although the exact location will 
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not be known for a number of reasons, including the fact that light travels at a 
slower speed through fiber. In this example, the intersection of the circles is the 
approximate location of P1 . An inference can then be made for sets of IP 
addresses that are physically close, but for which no actual measurements have 
5 been made; namely, that clients will experience similar network conditions. 

Therefore, in the example of FIG. 2, assume that measurements were taken for 
P1 and P2. but not for P3. If P3 is physically close to P1 and P2, a circle of 
inference 206 can be drawn for P3. This assumes, however, that set P3 was 
located one point in time so we know that set P3 is in the circle of inference for 
10 sets P1 and P2. 

It should be apparent from the foregoing discussion that the sets of IP 
addresses can vary in size and, in addition, that once a circle of inference is drawn 
around sets of IP addresses, those sets can be used for drawing new circles of 
inference. For example, suppose that P1 , P2 and P3 are sets of one. Once the 
15 circle of inference 206 is drawn around those sets, their combination can form a 
ffi new set P4 (not shown). Now. assume that P5 (not shown) falls within the circle 

S of inference for set P4. A new inferred set can then be created, which is the 

1, combination of P4 (P1 , P2 and P3) and P5. Therefore, we can create new sets 

Lq and sets of sets using this method. 

hi 20 Note that it is possible to implement circular intersection more efficiently by 

employing an approximation technique. Determining the intersection of a set of 
circles as the number of circles increases using conventional computational 
techniques can become computationally complex since the relationship between 
time that it takes to compute the intersections and the increased number of 
25 intersections is not linear. A linear approximation, on the other hand, is the fastest 
way to implement circular intersection even though it is not as accurate. This can 
be accomplished, for example, by summarizing the previous intersections with an 
intersection object. The intersection object is then intersected with a subsequent 
circle or another object representing that circle. Each time that an intersection is 
30 determined, the intersection is summarized with a new intersection object that is 
used for purposes of subsequent intersections. 

For example, suppose that we have three circles 300, 302 and 304 as 
described before. A linear approximation of the intersection can be determined as 
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follows. First, the intersection 306 of two of the circles, such as circles 300. 302 
would be determined as shown in FIG. 4A. Next, the area of intersection can be 
circumscribed by an intersection object 308. which is preferably rectangular in 
shape as shown in FIG. 4B. The intersection object 308 should represent the 
smallest bounding box that encloses the intersection. Referring also to FIG. 4C, 
the estimated intersection is the cross-hatched 310 area inside intersection object 
308. Now, when the third circle 304 is drawn, the intersection of circle 304 and 
intersection object 308 can be determined. In the example shown in FIG. 4D, the 
intersection of circle 304 and intersection object 308 is shown as the cross- 
hatched area 31 2^ However, since the intersection still involves taking curvature 
into account, computational efficiency can still be optimized. This can be 
accomplished by replacing circle 304 with a rectangular-shaped object 
representing circle 304. such as square 314 shown in FIG. 4E which 
circumscribes circle 304. The intersection is shown as the cross-hatched area 
316 bounded by a new rectangular-shaped intersection object 318. 

Intersection object 318 would then be used for the next intersection with a 
circle by again replacing the circle with a rectangular-shaped object representing 
that circle. As can be seen, therefore, each intersection creates a new 
intersection object that can be used for a subsequent intersection. 

In order to reduce computational complexity, note in FIG. 4E that square 
314 has been aligned with intersection object 308 such that corresponding sides 
are parallel. This ensures that intersection object 318 is rectangular-shaped. To 
further reduce computational complexity, it is also possible to rotate the coordinate 
system of the intersecting circles so that the longitudinal axis through intersection 
object 308 is oriented vertically and aligned with the vertical axis in a normal 
Cartesian coordinate system as shown in FIG. 4F. Additional refinements include, 
but are not limited to: 

(1 ) if one circle contains another circle, discarding the larger circle to 
provide a tighter intersection; 

(2) if more than two circles in a set are to be intersected and the circles 
are similar in size, starting with the two circles that are the farthest apart; 

(3) if more than two circles in a set are to be intersected and the circles 
are different in size, starting with the two circles having the smallest radii to 
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provide the tightest intersection. 

Therefore, the above-described method summarizes previous intersections 
with an intersection object. The intersection object is then intersected with a 
subsequent circle or an object representing the subsequent circle. While the 
5 intersection objects are preferably rectangular-shaped as described above, they 
could alternatively have other shapes that are also polygons. The method 
reduces computational complexity and increases computational speed by 
employing a linear computation technique, at the expense of accuracy, rather than 
employing a technique that is not linear. In other words, if n represents the 
10 number of circles being intersected, the conrTputational complexity for exact 

intersection is O(n^) whereas it is 0(n) in the approximation method described 
above. 

Accordingly, it can be seen that a spanner measures performance 
characteristics while circular intersection estimates the physical location of a client. 
^ 15 Accordingly, circular intersection is an operational tool that can be used to speed 
© up populating the store of cost measurements when actual measurements are not 

2 available; it is not, however, an essential part of the invention. 

1,^ Referring now to FIG. 10. an "intelligent inference" method according to the 

^ invention will be described which is more accurate than drawing a circle of 

ry 20 inference around physically proximate sets of IP address as was described 
y previously in connection with FIG. 2. In FIG. 4, let Ca represent a first content 

server, let Cb represent a second content server, let S^, S2, S3 and S4 represent 
sets of IP addresses, let L represent latency, and let D represent distance. We 
can use a spanner to obtain latency measurements from content server Ca to 
25 (Lia), S2 (Lsa) and S3 (Lag), and use circular intersection to determine the distances 
between S^ and S4 (DiJ. S2 and S4 (D24). and S3 and S4 (D34). With this 
information, we can obtain a weighted average of latency measurements to infer 
the latency measurement L4a from content server Ca to S4 as follows: 

r ^ 
1 


1 1 1 

•+ — + • 


vA4 A4 A4y 

30 By taking the weighted average in this manner, we obtain an approximation of the 
latency from content server Ca to S4. Note also that inferences can be made in 
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other ways, including, but not limited to, the following: 

1 . Monitoring sets of IP addresses and then treating all IP addresses in 
the set in the same fashion. 

2. Aggregating prefix groups based on similarities and treating all such 
5 aggregated prefix groups in the same fashion. 

3. Disaggregating prefix groups into subgroups that have similar 
characteristics and treating the subgroups in the same fashion. 

4. Placing IP addresses into sets based on some criteria of similarity 
such as, but not limited to, performance, geographical location, and other 

10 correlation factors, and treating all IP addresses in the sets in the same fashion. 

In other words, inferences are ideally made based on sets of IP addresses. 
A set can be a single IP address (e.g., a set of one), a group of IP address in a 
BGP4 prefix if the prefix is small, the union of two sets wherein the two sets are 
geographically close, the union to sets wherein the two sets experience similar 

15 performance in relation to content servers, and the like. 

Therefore, it will be appreciated that efficient implementation of the present 
invention requires cost measurements about the network that can used to 
determine if directing a client to a particular content server will satisfy a cost 
function, and it will further be appreciated that the decision may be irrespective of 

20 the actual physical location of the client and the content server. Initially, however, 
cost measurements may not be available for all clients and content servers. As 
are result, some amount of randomization or estimation may be necessary until 
the system is trained and the store of performance data is populated. In addition, 
knowledge of the physical location of the client and/or sets of IP addresses may 

25 be helpful for drawing an inference that proximate physical locations may 
experience similar network conditions. Such inferences can be used as an 
alternative to obtaining measurements from each client IP address to each PNAP 
or content server. Note, however, that some measurements, such as latency, 
need to be updated frequency due to changes in network conditions whereas 

30 geographical locations of clients are static. Therefore, inferences should be drawn 
close in time to when measurements of a dynamic nature are made. 

It will also be appreciated that the system of the present invention will 
automatically balance network load among PNAP's. Another load balancing 
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option would be to provide the customer with a software interface that allows thenn 
to report load on the content servers so that redirection can also be based on 
balancing server load. 

It will be appreciated that the method and system would preferably be 
implemented using a programmed data processor such as a computer and 
associated software to carry out the functions described herein. For example, 
DNS bridge 18. redirector 20, and spanners 28. 30 would be implemented as 
software running on a programmed data processor such as a conventional 
computer. Those skilled in the art will appreciate that the actual code to 
implement the invention as described herein can be written for various platforms 
using conventional programming techniques. Accordingly, the details of the code 
are not presented herein. 

As can be seen, therefore, the invention provides for obtaining cost 
measurements of various types associated with serving a request for content to a 
user and directing the user to a particular content server based in satisfaction of a 
cost function indicative of operational characteristics of the network. The invention 
operates in a closed loop wherein an additional cost measurement is made each 
time a user is directed to a content server and that information becomes part of 
the data store for evaluating which content server should be selected for serving 
subsequent requests for content from that content server. 

Although the description above contains many specificities, these should 
not be construed as limiting the scope of the invention but as merely providing 
illustrations of some of the presently preferred embodiments of this invention. 
Thus the scope of this invention should be determined by the appended claims 
and their legal equivalents. Therefore, it will be appreciated that the scope of the 
present invention fully encompasses other embodiments which may become 
obvious to those skilled in the art. and that the scope of the present invention is 
accordingly to be limited by nothing other than the appended claims, in which 
reference to an element in the singular is not intended to mean "one and only one" 
unless explicitly so stated, but rather "one or more." All structural, chemical, and 
functional equivalents to the elements of the above-described preferred 
embodiment that are known to those of ordinary skill in the art are expressly 
incorporated herein by reference and are intended to be encompassed by the 
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present claims. Moreover, it is not necessary for a device or metliod to address 
each and every problenn sought to be solved by the present invention, for it to be 
encompassed by the present claims. Furthermore, no element, component, or 
method step in the present disclosure is intended to be dedicated to the public 
regardless of whether the element, component, or method step is explicitly recited 
in the claims. No claim element herein is to be construed under the provisions of 
35 U.S.C. 112, sixth paragraph, unless the element is expressly recited using the 
phrase "means for." 


-18- 


